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PROCESSING UNIT AND PROCESSING METHOD 

CROSS-REFERENCE TO RELATED APPLICATIONS 
5 This Application is a divisional of U.S. Patent 
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2 0 0 2 , which is a continuation of U.S. Patent 
Application No . 0 9/ 97 4 , 8 07 , f i 1 e d o n 0 c t obe r 1 2 , 2 0 0 1 , 
which is a division of U.S. Patent Application No. 

10 0 9/ 14 7 , 6 63 , filed on February 9, 19 9 9, which is the 
National Stage of International Application No. 
PCT/JP98 /02 90 9 , f i 1 e d on June 2 9 , 1 9 98, the contents 
of which are incorporated by reference herein in their 
entireties. The International Application was not 

15 published in English. 

BACKGROUND OF THE INVENTION 

1. Field of the Invention 

20 The present invention relates to a processing 

unit, which is incorporated into a mobile 
communication apparatus, for performing an ASC 
(Addition, Comparison, and Selection) operation of 
particularly a Viterbi decoding. 

25 

2. Background Information 

In data communications in a mobile radio 



communication network, since a bit error frequently 
occurs, an execution of an error correction 
processing is needed. In the error correction 
methods, there is a method in which a c on vo 1 u t i on a 1 
5 code generated f rom an input bit is decoded by Viterbi 
decoding on a receiver side. In the error correction 
processing, a digital signal processor (hereinafter 
referred to as "DSP" ) is used. 

The Viterbi decoding repeats the simple 

10 processing such as addition, comparison, and 
selection and performs a trace-back operation for 
finally data, thereby realizing a maximum likelihood 
decoding of the convolut ional code. 

The following will briefly explain the Viterbi 

15 decoding processing. The c on vo 1 u t i ona 1 code is 
generated by mode 2 addition of input bits and a fixed 
number of bits precedent thereto. Then, a plurality 
of coding data is generated to correspond to one bit 
of the input bits. A number of input information 

20 bits having influence upon the coding data is called 
constraint length (K) . The number of input 
information bits is equal to a number of stages of 
shift registers used in mode 2 addition. 

The coding data is determined by the input bits 

25 and a state of the preceding (K-l) input bits. When 
a new information bit is input, the state of the input 
bits transits to a new state. The state in which 
coding data transits is determined by whether the 



new input bit is " 0 " or " 1 . " Since the respective 
(K-l) bits are "1" or "0", a number of states in which 
coding data transits becomes 2 <K_1) . 

In the Viterbi decoding, received coding data 
5 sequence is observed, and the most-likely state is 
estimated from all obtainable state transitions. 
For this reason, every time when coding data (received 
data sequence) corresponding to one bit of 
information bits, an inter-signal distance (metric) 

10 of the respective paths to each state at that point 
is computed. Then, operations for leaving a path 
having a smaller metric among the paths reaching the 
same state as a survivor are sequentially repeated. 
As shown in a state transition diagram of FIG. 

15 1, in a c on vo 1 u t i on a 1 encoder having a constraint 
length K, two paths each showing a state transition 
fromeach of state S[n] and S [n + 2 (K " 2) ] at one previous 
point extend to a state S[2n](n= positive integer) 
at a certain point. For example, in a case of K = 

20 3, a transition from each of S[l] (state SOI) and 
S[3] (state Sll) to S[2] (state S10) (state in which 
preceding two bits are input in order of " 1 " and " 0 " ) 
at the time of n = 1 is possible. Also, at the time 
of n = 2, a transition from each of S[2] (state S10) 

25 and S[4] (state S00) to S[4] (state S00) (state shown 
by low-order two bits) is possible. 

A path metric "a" is a sum of an inter-signal 
distance ( b r a n c h me t r i c ) "x" between an output symbol 
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of the path inputting to the state S[2n] and the 
received data sequence and a path metric "A." The 
path metric "A" is the total sum of branch metrics 
of the survivor paths up to the state S[n] at one 
5 previous state. Similarly, a path metric "b" is a 
sum of an inter-signal distance (branch metric) "y" 
between an output symbol of the path inputting to 
the state S[2n] and the received data sequence and 
a path metric U B." The path metric "B" is the total 

10 sum of branch metrics of the survivor paths up to 
the state S[n + 2 (K ~ 2) ] at one previous point. In the 
Viterbi decoding, the path metrics "a" and "b" 
inputting to the state S[2n] are compared with each 
other, and the smaller path is selected as a survivor 

15 path. 

In the Viterbi decoding, each processing of 
addition for obtaining the path metric, comparison 
between the path metrics and the selection of path 
is executed with respect to 2 {K ~ 2) states at each point. 
20 Moreover, in the selection of path, a history showing 
which path has been selected is left as a path select 
signal PS[i], [1= 0 to 2 (K_2) -1]. 

At this time, if a subscript (e.g., n) of one 
previous state of the selected path is smaller than 
25 a subscript (n + 2 (K_2) ) of one previous state of the 
non-selected other path , PS[i]=0 is established. If 
the subscript (n) it is larger than the subscript 
(n + 2 (K_2) ), PS[i]=l is established. 



In the case of FIG. 1, since n<(n + 2 (K ~ 2) ) is 
established, the state S[n + 2 (K ~ 2) ] is selected at the 
time of a>b and PS[S2n]=l is established, and the 
state S[n] is selected at the time of a^b and 
5 PS[S2n]=0 is established. 

Then, in the Viterbi decoding, data is decoded 
while being traced back to the path finally survived 
based on the path select signal. 

The following will explain the conventional 
10 processing unit for Viterbi decoding, TMS320C54x, 
which is a general processing unit, (manufactured 
by TEXAS INSTRUMENTS, hereinafter referred to as 
"C54x") being given as one example . I n a GSM ce 1 1 u 1 a r 
radio system, equation (1) set forth below is used 
15 as a convolut ional code. 

Gl (D) = 1 + D3 + D4 

G2 (D) = 1 + D + D3 + D4 ... ( 1 ) 

The above convolutional code is expressed by 
a trellis diagram of a butterfly structure shown in 
20 FIG. 2. The trellis diagram shows a state in which 
the convolutional code transits from a certain state 
to another state. Let us assume that constraint 
length K is 5. States of 2 (K ~ 2) = 16 or 8 butterfly 
structures are present for each symbol section. 
25 Then, two branches are input in each state, and a 
new path metric is determined by the ACS operations. 

The branch metric can be defined as the following 
equation ( 2 ) . 



M = SD(2*i)*B(J,0)+SD(2*i + l)*B(j,l) ... (2) 

where SD(2*i) denotes a first symbol of a symbol 
metric showing a soft decision input, and SD(2*i+l) 
denotes a second symbol of the symbol metric. 
5 B (J, 0) and B(j,l) conform to codes generated by a 
convolut ional encoder as shown in FIG. 3. 

In C54x, an arithmetic logic section 
(hereinafter referred to as "ALU" ) is set to a dual 
16-bit mode, thereby processing the butterfly 

10 structure at high speed. The determination of a new 
path metric (j) can be obtained by calculating two 
path metrics (2*J and 2*J+1) and the branch metrics 
(M and - M ) in parallel based on a DSADT instruction 
and executing a comparison based on a CMPS instruction . 

15 The determination of a new path metric (j+8) can be 
obtained by calculating two path metrics and the 
branch metrics (M and -M ) in parallel based on the 
DSADT instruction. The calculation results are 
stored in high and low order bits of a 

20 double-precision accumulator, respectively. 

The CMPS instruction compares the high and low 
order bits of the accumulator and stores a larger 
value in a memory. Also, every time when the 
comparison is executed, which value is selected is 

25 written in a 16-bit transition register (TRN) . The 
content written to the TRN is stored in the memory 
every time when each symbol processing is ended. 
Information to be stored in the memory is used to 
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search a suitable path in the trace-back processing. 
FIG. 4 shows a macro program for a butterfly operation 
of the Viterbi decoding. 

The values of the branch metrics are stored in 
5 the T register before the macro is called. FIG. 5 
shows an example of a memory mapping of the path 
metrics . 

8 butterfly operations are executed in one 
symbol section and 16 new states are obtained. This 

10 series of processing is repeatedly computed over 
several sections. After the end of the processing, 
the trace-back is executed so as to search a suitable 
pat h from 1 6 paths . Thereby, a decoding bit sequence 
can be obtained . 

15 The mechanism of the ACS operations of the C54x, 

which is the general DSP, can be thus explained. Then, 
in C54x, and the updates of two path metrics are 
realized with 4 machine cycles from the example of 
the macro program of FIG. 4. 

20 In the future, there is expected an increase 

in demand for non-voice communications requiring 
high quality transmission with a lower bit error rate 
than voice communications. As means for achieving 
the low bit error rate, there is means for increasing 

25 the constraint length K of the Viterbi decoding. 

However, if the constraint length is increased 
by a value corresponding to one bit, a number of path 
metrics (number of states) doubles. For this reason, 
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a number of operations in the Viterbi decoding using 
DSP double. Generally, an amount of information in 
non-voice communications is larger than the amount 
of information in voice communications. If the 
5 amount of information increases, the number of 
operations in the Viterbi decoding including the ACS 
operation also increases. An increase in number of 
operations using DSP makes it difficult to maintain 
a battery for a portable terminal for a long period 

10 of time . 

For the purpose of downsizing the portable 
terminal, reducing the we ight , and lowe ring the cos t , 
an area processed by a special LSI has been also 
designed to be implemented in one chip form using 

15 a DSP processing in recent years. 

However, an increase in the number of operations 
using DSP exceeds the processing capability of the 
existing DSP, thereby making it impossible to be 
implemented in one chip form using DSP. 

20 Moreover, if the function of DSP is highly 

enhanced to increase the number of operations, an 
increase in the cost of DSP itself is brought about. 
As a result, the reduction in the cost of the portable 
terminal cannot be realized. 

25 

A first object of the present invention is to 
provide a processing unit for efficiently processing 
an ACS operation of the Viterbi decoding by use of 



DSP with a small investment in software. The 
above object can be attained by arranging two pairs 
of comparing sections, an adding section, and a 
storing section for storing a comparison result in 
5 the processing unit and by executing the ACS operation 
in parallel . 



BRIEF DESCRIPTION OF THE DRAWINGS 
10 FIG. 1 is a trellis diagram showing a path of 

a state transition of a c on vo 1 u t i on a 1 encoder in 
Viterbi decoding; 

FIG. 2 is a schematic diagram showing a butterfly 
structure of the trellis diagram; 
15 FIG. 3 is a schematic view showing an example 

of codes generated by the convolutional encoder; 

FIG. 4 is a program view showing an example of 
a Viterbi operation for channel coding; 

FIG. 5 is a schematic view showing a pointer 
20 control and an example of path metric storage; 

FIG. 6 is a block diagram showing the structure 
of the processing unit of the first embodiment of 
the present invention; 

FIG. 7 is a block diagram showing an example 
25 of the convolutional encoder having a code rate 1/2; 

FIG. 8 is a schematic view showing the butterfly 
structure where a constraint length K = 4 ; 

FIG. 9 is a block diagram showing the structure 
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of the processing unit of the second embodiment of 
the present invention; 

FIG. 10 is a timing view explaining a pipe line 
operation of the processing unit of the second 
5 embodiment of the present invention; 

FIG. 11 is a schematic view showing an example 
of a memory access operation of RAM of the second 
embodiment of the present invention; 

FIG. 12 is a block diagram showing the structure 
10 of the processing unit of the third embodiment of 
the present invention; 

FIG. 13 is a schematic view showing an example 
of a memory access operation of a dual port RAM of 
the third embodiment of the present invention; 
15 FIG. 14 is a block diagram showing the structure 

of the processing unit of the fourth embodiment of 
the present invention; 

FIG. 15 is a timing view explaining a pipe line 
operation of the processing unit of the fourth 
20 embodiment of the present invention; 

FIG. 16 is a block diagram showing the structure 
of the processing unit of the fifth embodiment of 
the present invention; 

FIG. 17 is a view showing ACS operation results 
25 of the processing unit of the sixth embodiment of 
the present invention; 

FIG. 18 is a block diagram showing the structure 
of the processing unit of the sixth embodiment of 
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the present invention; 

FIG. 19 is a block diagram showing the structure 
of the processing unit of the seventh embodiment of 
the present invention; 
5 FIG. 20 is ablockdiagramshowingthe structure 

of the processing unit of the eighth embodiment of 
the present invention; 

FIG. 21 is an input/output view of a 4:2 
compressor of the eighth embodiment of the present 
10 invention; 

FIG. 22 is a block diagram showing the structure 
of the processing unit of the ninth embodiment of 
the present invention; 

FIG. 23 is a view showing a carry control of 
15 a double-precision A U ; 

FIG. 24 is a block diagram showing the structure 
of the processing unit of the tenth embodiment of 
the present invention; 

FIG. 25 is a block diagram showing the structure 
20 of the processing unit of the eleventh embodiment 
of the present invention; 

FIG. 26 is block diagram showing the structure 
of a mobile station apparatus of the twelfth 
embodiment of the present invention; 
25 FIG. 27 is a block diagram showing the structure 

of the mobile station apparatus of the thirteenth 
embodiment of the present invention; 

FIG. 28 a block diagram showing the structure 



of a base station apparatus of the fourteenth 
embodiment of the present invention; and 

FIG. 29 a block diagram showing the structure 
of the base station apparatus of the fifteenth 
5 embodiment of the present invention. 

DETAILED DESCRIPTION OF THE INVENTION 
Embodiments of the present invention will now 
be described with reference to the accompanying 
10 drawings. 

(First Embodiment ) 

FIG. 6 is a block diagram showing the structure 
of the processing unit of the first embodiment of 
the present invention. In FIG. 6, a path metric 
15 storing section 1 stores path metrics, and a data 
supply and a transfer of an operation result are 
executed via a bus 2 . A branch metric storing section 
3 stores branch metrics and a data supply is executed 
via a bus 4 . 

20 Comparing sections 5 and 9 compare data input 

from the path metric storing section 1 and the branch 
metric storing section 3 via the buses 2 and 4, 
respectively . 

Adding sections 6 and 10 add data read from the 
25 path metric storing section 1 and the branch metric 
storing section 3 via the buses 2 and 4, respectively. 

A comparison result storing section 7 stores 
a comparison result of the comparing section 5, and 
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a comparison result storing section 11 stores a 
comparison result of the comparing section 9. Then, 
the comparison result storing sections 7 and 11 
transfer the comparison results in the path metric 
5 storing section 1 via the bus 2. 

A selecting section 8 inputs an adding result 
of the adding section 6 and determines an output based 
on the comparison result of the comparing section 
5. A selecting section 12 inputs an adding result 

10 of the adding section 10 and determines an output 
based on the comparison result of the comparing 
section 9. Then, the selecting sections 8 and 12 
transfer the outputs determined based on the 
comparison results to the path metric storing section 

15 1 via a bus 1 3 . 

Next, the following will explain ACS operation 
of the processing unit of the first embodiment with 
reference to the drawings. In the explanation set 
forth below, it is assumed that data to be decoded 

20 is ones that are coded by a c on vo 1 u t i on a 1 encoder 
of FIG. 7 where a constraint length K=4 and a code 
rate 1/2. Also, data type of the path metrics and 
that of the branch metrics are single-precision data. 
Then, when d oub 1 e - p r e c i s i o n data is set to (X, Y) 

25 for the sake of convenience, a high order position 
of the double-precision data is set to X and a low 
order position thereof is set to Y. 

Four branch metrics are set to BM 0 , BMl, BM2 , 
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BM 3 , respectively. If a state transition is 
illustrated using these branch metrics, the 
butterfly structure is shown as in FIG. 8. 

Here, attention should be paid to nodes NO and 
5 Nl of an old state. The transition destinations of 
the nodes NO and Nl are nodes N'O and N'4, 
respectively . 

Then, a branch metric, which is obtained at the 
time of the transition from the node NO to the node 

10 N'O, is BMO, and a branch metric, which is obtained 
at the time of the transition from the node Nl to 
the node N'O, is BM1. Also, a branch metric, which 
is obtained at the time of the transition from the 
node NO to the node N'4, is BM1, and a branch metric, 

15 which is obtained at the time of the transition from 
the node Nl to the node N'4, is BMO. 

Thus, the path metric PMO of the node NO and 
the path metric PMl of the node Nl are replaced with 
branch metrics BMO and BM1, respectively, and these 

20 metrics are added. Thereby, path metric PM'O of the 
node N'O and path metric PM'4 of the node N'4 are 
obtained . 

Then, this relationship can be applied to the 
other pairs of nodes (a pair of nodes N2 and N3, a 
25 pair of nodes N4 and N5, a pair of nodes N6 and N7) . 

The inventor- of the present invention paid 
attention to this relationship, and found out that 
two path metrics could be updated simultaneously by 



processing the ACS operation in parallel and that 
processing time could be reduced. This led to the 
present invention. 

The ACS operation of the node N'O to N'3 in the 
5 first half is executed by the comparing section 5, 
the adding section 6, the comparison result storing 
section 7, and the selecting section 8. In parallel 
with this operation, the ACS operation of the node 
N'4 to N'8 in the second half is executed by the 

10 comparing section 9, the adding section 10, the 
comparison result storing section 11, and the 
selecting section 12. The following will 

specifically explain the ACS operation from nodes 
NO and Nl to nodes N'O and N'4. 

15 First, two path metrics (PM1, PMO) are output 

to the bus 2 from the path metric storing section 
1. On the other hand, two branch metrics (BMl, BMO) 
are output to the bus 4 from the branch metric storing 
section 3 . 

20 The comparing section 5 inputs two path metrics 

(PM1, PMO) from the bus 2 and two branch metrics (BMl, 
BMO) from the bus 4 so as to calculate 
PM1+BM1-PM0-BM0 . 

The adding section 6 inputs two path metrics 

25 (PM1, PMO) from the bus 2 and two branch metrics (BMl, 
BMO) from the bus 4 so as to calculate PM1 + BM1 and 
PM0 + BM0. Then, t h e c a 1 cu 1 a t i on r e s u 1 1 s (as PM1 + BM1, 
PM0+BM0) are output to the selecting section 8. 
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The selecting section 8 inputs the most 
significant bit (hereinafter referred to as "MSB" ) 
which is the code bit of the comparison result of 
the comparing sect ion 5 , PM 1 + BM 1 - PM 0 - BM 0 . Then, the 
selecting section 8 selects as to whether the high 
order PM1 + BM1 is output to the bus 13 or the low order 
PMO + BMO is output thereto from the value of the MSB. 

In other words, if the equation (3) shown below 
is established, the MSB is 0 and the selecting section 
8 outputs the low order PMO + BMO to the bus 13 as PM'O. 
Conversely, if the equation (3) is not established, 
the MSB is 1 and the selecting section 8 outputs the 
high order PM1+BM1 thereto as PM'O. 

PM1 + BM1 ^ PMO + BMO ... (3) 

Also, the MSB, which is the comparison result 
of the comparing section 5, is stored in the 
comparison result storage section 7 at the same time. 

The comparing section 9 inputs two path metrics 
(PM1, PMO) from the bus 2 and two branch met rics (BM1, 
BM 0 ) from the bus 4 so as to calculate 
PM1+BM0-PM0-BM1 . 

The adding section 10 inputs two path metrics 
(PM1, PMO) from the bus 2 and two branch me tries (BM1, 
BMO) from the'bus 4 so as to calculate PM1 + BM0 and 
PM0 + BM1. Then, the calculation results (as PM1 + BM0, 
PM0+BM1) are output to the selecting section 12. 

The selecting section 12 inputs the MSB of the 
comparison result of the comparing section 9, 



PM1+BM1- PMO-BMl. Then, the selecting section 12 
selects as to whether the high order PM1 + BM0 is output 
to the bus 13 or the low order PMO + BMl is output thereto 
from the value of the MSB. 
5 In other words, if the equation (4) shown below 

is established, the MSB is 0 and the selecting section 
12 outputs the low order PMO+BMl to the bus 13 as 
PM'4. Conversely, if the equation (4) is not 
established, the MSB is 1 and the selecting section 

10 12 outputs the high order PM1+BM0 thereto as PM'4. 
PM1 + BM0 ^ PMO + BMl ... (4) 

Also, the MSB, which is the comparison result 
of the comparing section 9, is stored in the 
comparison result storage section 11 at the same time . 

15 The above processing is subjected to the other 

node pairs in the same way. As a result, the ACS 
operation of the Viterbi coding using DSP can be 
executed in parallel and the operation processing 
can be performed with relatively a small amount of 

20 processing at high speed. 

The above embodiment explained the case of the 
constraint length K = 4 and the code rate 1/2 . However, 
even if the constraint length and the code rate are 
the other values, the above relationship is 

25 established. Therefore, the change corresponding 
thereto is suitably provided, so that the same 
advantage can be obtained. 
( Second Embodiment ) 
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FIG. 9 is a block diagram showing the structure 
of the processing unit of the second embodiment of 
the present invention. In the processing unit of 
FIG. 9, the same reference numerals are added to the 
5 portions common to the processing unit of FIG. 6 and 
the explanation is omitted. 

In the processing unit of FIG. 9, the storing 
section for storing the path metrics is formed by 
a RAM 14 having four banks. 
10 The processing unit of FIG. 9 is suitable for 

the operation processing of a pipeline structure 
shown in FIG . 10 . 

For example, for executing the ACS operation 
at an operation execution stage of n-th + 1 cycle 
15 in an instruction 1, it is required that addresses 
of the path metrics to be read at a memory access 
stage of n-th cycle should be supplied to the RAM 
14 in advance . 

It is assumed that the RAM 14 is a 
20 double-precision readable RAM that can read an even 
address and an odd address continuously. Then., if 
the following conditions (a) and (b) are satisfied, 
two path metrics used in the operation can be read 
by only designating the even address. 
25 (a) The path metrics of one state are stored 

at continuous addresses in order of the even address 
and the odd address. 

(b) The path metrics of one state are divided 
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into the first and second halves, and each is stored 
in a different bank. 

For example, the path metrics (PMO, PMl, PM2 , 
PM3 in FIG. 8) of the first half of the old state are 
5 stored in the bank 0 of the RAM 14. Then, the path 
metrics (PM4, PM5, PM6, PM7 in FIG. 8) of the second 
half of the old state are stored in the bank 1. In 
this case, two path metrics are generated by executing 
the ACS operation at one cycle, and these metrics 

10 are stored in banks 2 and 3 via thebus 13, respectively. 
At this time, d ou b 1 e - p r e c i s i on data is transferred 
from the bus 13, the path metric of the node N'3 is 
stored in the bank 2 from the node N'O, and the path 
metric of the node N'7 is stored in the bank 3 from 

15 the node N ' . L 

FIG. 11 is a schematic view showing an example 
of a memory access operation of the RAM 14 
corresponding to FIG. 8. 

When the ACS operation of one state is ended, 

20 in a next state, the path metrics of the old state 
are read from the banks 2 and 3 and the path metrics 
of a new state are stored in the banks 0 and 1. 

Thus, every time when the ACS operation of one 
state is ended, the pair of banks for reading the 

25 path metrics and the pair of banks for storing the 
path metrics are switched using RAM 14 having four 
banks as the storing section for storing the path 
metrics. Thereby, the ACS operation of the Viterbi 
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decoding using DSP can be executed in parallel. 

In the above explanation, the banks 0 and 1 and 
the banks 2 and 3 were paired, respectively. However, 
even if the other combinations are used, the similar 
5 operation can be executed by only changing the address 
to be used in supplying the metrics at the memory 
access stage and the address to be used in storing 
the metrics. Moreover, in the second embodiment, 
the RAM 14 was formed by four banks. However, the 
10 similar operation can be executed if the number of 
banks is more than four. 
(Third Embodiment) 

FIG. 12 is a block diagram showing the structure 
of the processing unit of the third embodiment of 
15 the present invention. In the processing unit of 
FIG. 12, the same reference numerals are added to 
the portions common to the processing unit of FIG. 
6 and the explanation is omitted. 

In the processing unit of FIG. 12, the storing 
20 section 3 for storing the path metrics is formed by 
a dual RAM 15 having three banks. 

The processing unit of FIG. 12 is suitable for 
the operation processing of the pipe line structure 
shown in FIG . 1 0 . 
25 Since the storing section for storing the path 

metrics is the dual port RAM 15 in the processing 
unit of FIG. 12, designation of reading and writing 
to the same bank can be executed with one instruction. 
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For example, for executing the ACS operation at an 
operation execution stage of n-th + 1 cycle in an 
instruction 1, an address for reading the path metric 
at a memory access stage of n-th cycle and an address 
5 for writing the path metric are supplied to the dual 
port RAM 15. Thereby, at the n-th +1 cycle, an even 
address- and an odd address can be continuously read 
from the dual port RAM 15 so as to execute the ACS 
operation. Moreover, one path me trie can be written 

10 to the same bank. 

In the processing unit of the third embodiment, 
if the following conditions (a) and (b) a r e s a t i s f i e d , 
two path metrics used in the operation can be read 
by only designating the even address. 

15 (a) The path metrics of one state are stored 

at continuous addresses in order of the even address 
and the odd address. 

(b) The path metrics of one state are divided 
into the first and second halves, and each is stored 

20 in a different bank. 

For example, the path metrics (PMO, PMl, PM2, 
PM3 in FIG. 8) of the first half of the old state are 
stored in the bank 0 of the dual port RAM 15, and 
the path metrics (PM4, PM 5 , PM6, PM7 in FIG. 8) of 

25 the second half of the old state are stored in the 
bank 1. In this case, two path metrics are generated 
by executing the ACS operation at one cycle, and these 
metrics are stored in banks 0 and 2 via the bus 13, 
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respectively. At this time, the bus 13 transfers 
double-precision data, the path metric of the node 
N ' 3 is stored in the bank 0 from the node N'O, and 
the path metric of the node N'7 is stored in the bank 
5 2 from the node N' 4 . 

FIG. 13 is a schematic view showing an example 
of a memory access operation of the RAM 15 
corresponding to FIG. 8. 

In the processing unit of FIG. 12, when the ACS 
10 operation of one state is ended, only the banks 1 
and 2 are switched. Then, the ACS operation of the 
Viterbi decoding using DSP can be executed in parallel 
without switching the bank 0. 

In the third embodiment, the dual port RAM 15 
15 was formed by three banks. However, the similar 
operation can be executed if the number of banks is 
more than three. 

(Fourth Embodiment ) 

FIG. 14 is a block diagram showing the structure 
20 of the processing unit of the fourth embodiment of 
the present invention. In the processing unit of 
FIG. 14, the same reference numerals are added to 
the portions common to the processing unit of FIG. 
6 and the explanation is omitted. 
25 The processing unit of FIG. 14 comprises input 

registers 16 and 17 for inputting data from the bus 
2 and for outputting data to the comparing sections 
5, 9, and the adding sections 6, 10. 
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The processing unit of FIG. 14 is suitable for 
the operation processing of the pipe line structure 
shown in FIG . 15 . 

For example, for executing the ACS operation 
5 at an operation execution stage of n-th + 2 cycle 
in an instruction 1, an address for reading the path 
metric at an memory access stage of n-th cycle is 
supplied to the RAM 14 in advance. Then, data output 
from the RAM 14 is latched to the input registers 
10 16 and 17 via the bus 2 at a data transfer stage of 
n-th + 1. 

The pipe shown in FIG. 15 is structured so that 
one data transfer stage is inserted between a memory 
access stage and an operation execution stage of the 

15 pipe line shown in FIG. 10. In other words, data 
output from the RAM 14 is determined at the input 
registers placed at the front of the respective 
operation devices (comparing sections 5, 9, and 
adding sections 6, 10) at a starting point of the 

20 operation execution stage. As a result, time 
required for data transfer from the RAM 14 can be 
omi t ted. 

Therefore, according to this embodiment, the 
ACS operation of the Viterbi decoding using DSP can 
25 be executed in parallel at relatively high speed. 
Note that the similar operation can be executed if 
the dual'port RAM is used as the storing section for 
storing the path metrics. 
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(Fifth Embodiment.) 

FIG. 16 is a block diagram showing the structure 
of the processing unit of the fifth embodiment of 
the present invention. In the processing unit of 
5 FIG. 16, the same reference numerals are added to 
the portions common to the processing unit of FIG. 
14 and the explanation is omitted. 

In the processing unit of FIG . 16, a swap circuit 
18 is added as compared with the processing unit of 

10 FIG. 14. The swap circuit 18 directly outputs data 
input from the branch metric storing section 3 or 
swaps the high order position and the low order 
position so as to be output. 

The processing unit of FIG. 16 is suitable for 

15 the operation processing of the pipe line structure 
shown in FIG. 15. 

For example, let us assumed that data is input 
as double-precision data in a form of { BMl , BMO } from 
the branch metric storage 3. In this case, the swap 

20 circuit 18 has a function of switching whether values 
of two branch metrics are directly output as {BMl, 
BMO} or the high order position and the low order 
position are swapped so as to be output as {BMO, BMl} 
by an instruction. 

25 The following will explain an operation of the 

swap circuit 18 using the c o nvo 1 u t i o na 1 encoder of 
FIG. 7 and the path metric transition state of the 
butterfly structure of FIG. 8 where the constraint 
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length K=4 and the code rate is 1/2. 

As shown in FIG. 17, the ACS operation, which 
is executed at the time of the transition from the 
nodes NO and Nl of the old state to the nodes N'O 
5 and N'4, and the ACS operation, which are executed 
at the time of the transition from the nodes N6 and 
N7 of the old state to the nodes N'3 and N' 7 , are 
compared with each other. As a result, in both ACS 
operations, common branch metrics BMO and BM1 are 

10 used and the relationship in which BMO and BM1 are 
swapped is established. 

The ACS operation, which is executed at the time 
of the transition from the nodes NO and Nl to the 
node N'O, and the ACS operation, which is executed 

15 at the time of the transition from the nodes N6 and 
N7 to the node N'3 are performed by the comparing 
section 5 and the adding section 6. On the other 
hand, the ACS operation, which is executed at the 
time of the transition from the nodes N3 and Nl to 

20 thenodeN'4, a n d t h e AC S ope r a t i on , wh i c h i s e x e c u t e d 
at the time of the transition from the nodes N6 and 
N7 to the node N'7, are performed by the comparing 
section 9 and the adding section 10. 

For this reason, if the branchmetrics are stored 

25 in the branch metric storing section 3 in both forms 
of {BMO, BM 1 } and { BM 1 , BMO}, the branch metric 
storing section 3 results in a redundant hardware 
source . 
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The swap circuit 18 is used to solve such 
redundancy. For example, the branch metrics are 
stored in the branch metric storing section 3 in only 
the form of { BM 0 , BM 1 } . Then, the metrics in the 
5 form of { BM 0 , MB 1 } are input to the swap circuit 18. 
The swap circuit 18 swaps the metrics in the form 
of { BM 0 , BM 1 } or the metrics in the form of { BM 1 , 
BM 0 } so as to be output by an instruction. Thereby, 
redundancy of the branch metric storing section 3 

10 can be omitted. 

The above embodiment was explained using the 
nodes NO, Nl, N6, N7 of the old state where the 
constraint length K=4 and the code rate was 1/2. 
However, the aforementioned relationship can be 

15 established using even the nodes N2 , N3, N4, N5. Also, 
the aforementioned relationship can be established 
using the other combinations of the constraint length 
K and the code rate. Therefore, the similar 
operation can be executed. Moreover, the similar 

20 operation can be executed even if the dual port RAM 
is used as the storing section for storing the path 
met r i c s . 

(Sixth Embodiment) 

FIG. 18 is a block diagram showing the structure 
25 of the processing unit of the sixth embodiment of 
the present invention. In the processing unit of 
FIG. 18, the same reference numerals are added to 
the portions common to the processing unit of FIG. 
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16 and the explanation is omitted. 

As compared with the processing unit of FIG. 
16, in the processing unit of FIG. 18, the comparing 
section 5 comprises adders 19, 20, and a comparator 
5 21, and the adding section 6 comprises adders 22 and 
23. Also, the comparing section 9 comprises adders 
2 4 , 2 5 , and a comparator 26, and the adding section 
10 comprises adders 27 and 28. 

In FIG. 18, the adders 19 and 20 input data from 

10 the bus 4 and the input register 16 and add these 
input data. The comparator 21 inputs addition 
results from the adders 19 and 20 and compares the 
addition results, and outputs a comparison result 
to the comparison result storing section 7 and the 

15 selecting section 8. The adders 22 and 23 input data 
from the bus 4 and the input register 16 and add these 
input data, and output addition results to the 
selecting section 8. 

The adders 24 and 25 input data from the bus 

20 4 and the input register 17 and add these input data. 
The comparator 26 inputs addition results from the 
adders 24 and 25 and compares the addition results, 
and outputs a comparison result to the comparison 
result storing section 11 and the selecting section 

25 12. The adders 27 and 28 input data from the bus 
4 and the input register 17 and add these input data, 
and output addition results to the selecting section 
12 . 
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The processing unit of FIG. 18 is suitable for 
the operation processing of the pipe line structure 
shown in FIG . 15 . 

Next, the ACS operation of the sixth embodiment 
5 will be explained. This explanation will be given 
using the convolut iona 1 encoder of FIG. 7 and the 
butterfly structure of FIG. 8 where the constraint 
length K=4 and the code rate is 1/2, and the ACS 
operation result of FIG. 17. 

10 As shown in FIG. 18, two metrics are output as 

{A, B} from the input registers 16 and 17, and two 
branch metrics are output as {C, D} from the swap 
circuit 18. At this time, the adder 19 inputs the 
path metric {A} and the branch metric {C} , and outputs 

15 an addition result {A+C}. The adder 20 inputs the 
path metric {B} and the branch metric {D}, and outputs 
an addition result {B+D}. The comparator 21 inputs 
the addition result {A+C} of the adder 19 and the 
addition result {B+D} of the adder 20, compares 

20 {A + C- (B+D)}, and outputs the MSB of the comparison 
result. The adder 22 inputs the path metric {A} and 
the branch metric {C}, and outputs the addition result 
{A + C}. The adder 23 inputs the path metric {B} and 
the branch metric {D} , and outputs the addition result 

25 { B+D} . 

On the other hand, the adder 24 inputs the path 
metric {A} and the branch metric {D}, and outputs 
an addition result {A+D}. The adder 25 inputs the 
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pathmetric {B} a n d t he b r a n ch me t r i c {C}, and outputs 
an addition result {B+C}. The comparator 26 inputs 
the addition result { A+ D } of the adder 24 and the 
addition result {B+C} of the adder 25, compares 
5 {A+D-(B+C)}, and output the MSB of the comparison 
result. The adder 27 inputs the path metric {A} and 
the branch me trie {D} , and outputs the addition result 
{A+D} . The adder 28 inputs the path metric {B} and 
the branch metric {C} , and outputs the addition result 
10 { B+C } . 

By the above structure and the operation, if 
two path metrics of the input registers 16 and 17 
are set to { A , B } = { PM 1 , PM 0 } and the outputs of the 
swap circuit 18 are set to {C, D} = {BM1,BM0} , the ACS 

15 operation, which is executed at the time of the 
transition from the nodes NO and Nl of the old state 
to the nodes N'O and N' 4 , can be realized. 

Also, if two path metrics of the input registers 
16 and 17 are set to { A , B } = { PM 1 , PM 0 } and the outputs 

20 of the swap circuit 18 are set to { C , D } = { BM 0 , BM 1 } , 
the ACS operation, which is executed at the time of 
the transition from the nodes NO and Nl of the old 
state to the nodes N'O and N'4, can be realized. 

Therefore, according to the sixth embodiment, 

25 the update of two path metrics can be realized at 
one machine cycle by the pipe line operation using 
DSP. The above embodiment was explained using the 
nodes NO, Nl, N6, N7 of the old state where the 
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constraint length K=4 and the code rate was 1/2. 
However, the aforementioned relationship can be 
established using even the nodes N2, N3, N4, N5. Also, 
the aforementioned relationship can be established 
5 using the other combinations of the constraint length 
K and the code rate. Therefore, the similar 
operation can be executed. Moreover, the similar 
operation can be executed even if the dual port RAM 
is used as the storing section for storing the path 
10 met r i c s . 

(Seventh Embodiment ) 

FIG. 19 is a block diagram showing the structure 
of the processing unit of the seventh embodiment of 
the present invention. In the processing unit of 

15 FIG. 19, the same reference numerals are added to 
the portions common to the processing unit of FIG. 
18 and the explanation is omitted. 

As compared with the processing unit of FIG. 
18, in the processing unit of .FIG. 19, an arithmetic 

20 logic section (hereinafter referred as U ALU' ; ) 29 is 
used in place of the comparator 21. Then, the 
processing unit of FIG. 19 comprises input registers 
30, 31, buses 32 , 33 , 37 , 3 8 , and selectors 34 and 
35 . 

25 In FIG. 19, the register 30 inputs data from 

the RAM 14 via the bus 37. The register 31 inputs 
data from the RAM 14 via the bus 38. The buses 32 
and 33 input data from a register file 36. The 
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selector 34 selects an output of input data from the 
bus 32, the adder 19, and the input register 30. The 
selector 35 selects an output of input data from the 
bus 33, the adder 20, and the input register 31. The 
5 ALU 29 inputs data from the selectors 34 and 35 and 
executes an arithmetic logic operation, and outputs 
a result of the arithmetic logic operation to the 
bus 13. Also, the ALU 29 outputs the MSB of the 
result of the arithmetic logic operation to the 
10 comparison result storing section 7 and the selecting 
section '8. 

The processing unit of FIG. 19 is suitable for 
the operation processing of the pipe line structure 
shown in FIG. 15. 

15 In the case where the ALU 29 performs the ACS 

operation, the selector 34 selects an output of the 
adder 19 and inputs the selected output to the ALU 
29. The selector 35 selects an output of the adder 
20 and inputs the selected output to the ALU 29. Then, 

20 the ALU 29 subtracts input two data, and the MSB of 
the subtraction result to the comparison result 
storing section 7 and the Selecting section 8. 

In the case where the ALU 29 performs the 
arithmetic logic operation between the 

25 register-register, the selectors 34 and 35 select 
the buses 3 2 and 3 3 , respectively. Then, data, which 
is output to the buses 32 and 33 from the register 
file 36, is input to the ALU 29. 
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Also, in the case where the ALU 29 performs the 
arithmetic logic operation between the 

register-memory, the selectors 34 and 35 select the 
bus 32 and the input register 31, respectively. Then, 
5 data, which is output to the bus 32 from the register 
file 36, and data, which is output to the input 
register 31 from the RAM 14 via the bus 38, are input 
to the ALU 2 9 . 

Conversely, in the case where the ALU 2 9 performs 

10 the arithmetic logic operation between the 
memory-register, the selectors 34 and 35 select the 
input register 30 and the bus 33 , respectively. Then, 
data, which is output to the register 30 from the 
RAM 14 via the bus 37, and data, which is output to 

15 the bus 33 from the register file 36, are input to 
the ALU 29. 

Also, in the case where the ALU 29 performs the 
arithmetic logic operation between the memory 
-memory, the selectors 34 and 35 select the input 
20 registers 30 and 31, respectively. Then, data, 
which is input to the input registers 30 and 31 from 
the RAM 14 via the buses 37 and 38, is input to the 
ALU 29. 

Thus, according to the seventh embodiment, for 
25 implementing the processing unit in an LSI form, one 
of the comparators for executing the ACS operations 
is used as ALU. Thereby, a chip area can be decreased, 
and the manufacturing cost can be reduced. Note that 
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the similar operation can be executed even if the 
dual port RAM is used as the storing section for 
storing the path metrics. 
(Eighth Embodiment ) 
5 FIG. 20 is a blockdiagramshowing the structure 

of the processing unit of the eighth embodiment of 
the present invention. In the processing unit of 
FIG. 20, the same reference numerals are added to 
the portions common to the processing unit of FIG. 

10 19 and the explanation is omitted. 

As compared with the processing unit of FIG. 
19, in the processing unit of FIG. 20, two adders 
19 and 20 are formed by a 4:2 compressor 39, and two 
adders 24 and 25 are formed by a 4:2 compressor 40. 

15 In the 4 : 2 compressors 39 and 40, s i ng 1 e b 1 o c k s , shown 
in FIG. 21, corresponding to a number of single 
precision bits, are connected in series. The 4:2 
compressors 39 and 40 execute an addition processing 
at higher speed than the general full adders. 

20 In FIG. 20, the 4:2 compressor 39 inputs data 

- from the bus 4 and the input register 16, and outputs 
an operation result to the selectors 34 and 35. The 
4:2 compressor 40 inputs data from the bus 4 and the 
input register 17, and outputs an operation result 

25 to the comparator 26. 

The processing unit of FIG. 20 is suitable for 
the operation processing of the pipe line structure 
shown in FIG. 15. 



Next, the ACS operation of the eighth embodiment 
will be explained. This explanation will be given 
using the convolut ional encoder of FIG. 7 and the 
butterfly structure of FIG. 8 where the constraint 
5 length K=4 and the code rate is 1/2, and the ACS 
operation result of FIG. 17. 

First of all, two metrics are output as {A, B} 
from the input registers 16 and 17, and two branch 
metrics are output as {C, D} from the swap circuit 
10 18 . 

Then, the 4:2 compressor 39 inputs the path 
metric {A} and the branch metric {C}, a reverse {~B} 
for path metric {B}, and a reverse { ~~D } for branch 
metric D, and outputs {A + C} and {B+D}. Two outputs 

15 {A + C} and {B + D} of the 4:2 compressor 39 are input 
to the ALU 29 via the selectors 34 and 35 so as to 
be added. In this case, to realize two complements 
{B} and {D}, " 1 " is input to the 4:2 compressor 39 
and the least significant carry input of the ALU 29. 

20 As a result, {A + C-(B + D)} is obtained and the MSB is 
output from the ALU 29. 

Also, the adder 22 inputs the path metric {A} 
and the branch metric {C}, and outputs the addition 
result {A+C}. Similarly, the adder 23 inputs the 

25 pathmetric {B} a n d t h e b r a n ch me t r i c {D}, and outputs 
the addition result {B+D} . 

On the other hand, the 4:2 compressor 40 inputs 
the path metric {A} and the branch metric {D}, a 



« • 



reverse {~B} for path metric {B}, and a reverse {~C} 
for branch metric C, and outputs {A+C} and {B+D}. 
Two outputs {A+C} and {B+D} of the 4:2 compressor 
40 are input to the comparator 26 so as to be added. 
5 In this case, to realize two complements {B} and {C} , 
"1" is input to the 4:2 compressor 40 and the least 
significant carry input of the comparator 26. As 
a result, {A+D-(B+C)} is obtained and the MSB is 
output from the comparator 26. 

10 Also, the adder 27 inputs the path metric {A} 

and the branch metric {D}, and outputs the addition 
result {A + D} . Similarly, the adder 28 inputs the 
pathmetric {B} and the branch me t ri c {C}, and outputs 
the addition result {B+C}. 

15 By the above structure and the operation, if 

two path metrics { A , B } of the input registers 16 and 
17 are set to {PM1,PM0} and the outputs {C,D} of the 
swap circuit 18 are set to { BM 1 , BM 0 } , the ACS 
operation, which is executed at the time of the 

20 transition from the nodes NO and Nl of the old state 
of FIG. 17 to the nodes N'O and N'4, can be realized. 

Also, if two path metrics { A , B } of the input 
registers 16 and 17 are set to { PM 1 , PM 0 } and the 
outputs {C,D} of the swap circuit 18 are set to 

25 { BM 0 , BM 1 } , the ACS operation, which is executed at 
the time of the transition from the nodes NO and Nl 
of the old'state of FIG. 17 to the nodes N'O and N'4, 
can be realized. Therefore, the update of two path 
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metrics can be realized at one machine cycle by the 
pipe line operation using DSP. 

Thus, according to the eighth embodiment, the 
use of the 4:2 compressors as the comparing section 
for executing the ACS operation can realize the higher 
speed computation than the case using two adders. 
The above embodiment was explained using the nodes 
NO, Nl, N6, N7 of the old state where the constraint 
length K=4 and the code rate was 1/2. However, the 
aforementioned relationship can be established 
using even the nodes N2, N3, N4, N5. Also, the 
aforementioned relationship can be established 
using the other combinations of the constraint length 
K and the code rate. Therefore, the similar 
operation can be executed. Moreover, the similar 
operation can be executed even if the dual port RAM 
is used as the storing section for storing the path 
metrics . 

(Ninth Embodiment) 

FIG. 22 is a block diagram showing the structure 
of the processing unit of the seventh embodiment of 
the present invention. In the processing unit of 
FIG. 22, the same reference numerals are added to 
the portions common to the processing unit of FIG. 
20 and the explanation is omitted. 

As compared with the processing unit of FIG. 
20, in the processing unit of FIG. 22, 
double-precision adders 41 and 42 are used as adding 
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sections, and at least one of the adders uses a 
doub 1 e -pr e c i s i on AU 41. 

In FIG. 22, the double-precision AU 41 inputs 
data in a do ub 1 e - p r e c i s i on form from the input 
5 register 16 and the bus 4 and executes a 
double-precision arithmetic operation. The 
double-precision adder 42 inputs data in a 
double-precision form from the input register 17 and 
the bus 4 and executes a d oub 1 e - p r e c i s i on adding 

10 operation. The double-precision AU 41 outputs an 
operation result to the selecting section 8 and the 
bus 13, and the output of the double-precision adder 
42 is output to the selecting section 12. 

The processing unit of FIG. 22 is suitable for 

15 the operation processing of the pipe line structure 
shown in FIG . 15. 

For executing the ACS operation in the ninth 
embodiment, the d o ub 1 e - p r e c i s i o n AU 41 inputs two 
path metrics as {A, B} in a double-precision form 

20 from the input register 16. Then, the 

double-precision AU 41 inputs two branch metrics as 
{C, D} in a doubl e -pr e ci s i on form from the swap 
circuit 18 via the bus 4, and executes a 
doubl e - p re c i s ion addition. At this time, the 

25 double-precision AU 41, as shown in FIG. 23, forcibly 
zeros the carry from the bit position of the 
s ingl e - p r e c i s i on MSB to a next stage, and executes 
two additions of the path metrics and the branch 
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metrics, {A + C, B+D}, simultaneously. 

On the other hand, the d oub 1 e - p r e c i s i on adder 
42 inputs two path metrics as { A , B} in a 
double-precision form from the input register 17. 
5 Then, the double-precision adder 4 2 inputs two branch 
metrics as {D, C} in a double-precision form from 
the swap circuit 18 via the bus 4. Then, the 
double-precision adder 42 forcibly zeros the carry 
from the bit position of the s i ng 1 e - p r e c i s i on MSB 

10 to a next stage, and executes two additions of the 
path metrics and the branch metrics, {A+C, B+D}, 
simultaneously . 

Thus, according to the ninth embodiment, the 
double-precision AU 41 is used as the adding section 

15 for executing the ACS operation. At the time of the 
ACS operation, the d oub 1 e - p r e c i s i o n AU 41 forcibly 
zeros the carry from the bit position of the 
s ing 1 e -p re c i s i on MSB to the next stage. At the time 
of the double -pre ci s ion arithmetic operation other 

20 than the ACS operation, the control for propagating 
the carry is added. Thereby, for example, the 
double-precision AU 41 can be used as a 
doub 1 e - p r e c i s i on accumulation adder at the time of 
product and addition operations. Therefore, in the 

25 case of implementing the processing unit in an LSI 
form, the chip area can be further decreased, and 
the manufacturing cost can be reduced. Note that 
the similar operation can be executed even if the 
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dual port RAM is used as the storing section for 
storing the path metrics. 
(Tenth Embodiment ) 

FIG. 24 is a block diagram showing the structure 
5 of the processing unit of the tenth embodiment of 
the present invention. In the processing unit of 
FIG. 24, the same reference numerals are added to 
the portions common to the processing unit of FIG. 
22 and the explanation is omitted. 
10 As compared with the processing unit of FIG. 

22, in the processing unit of FIG . 20, shift registers 
43 and 44 are used as a comparison result storing 
section . 

In FIG. 24, the shift register 43 inputs the 
15 MSB of the operation result of the ALU 29 so as to 
be output to the bus 2. The shift register 44 inputs 
the MSB of the operation result of the comparator 
26 so as to be output to the bus 2. 

The processing unit of FIG. 24 is suitable for 
20 the operation processing of the pipe line structure 
shown in FIG. 15. 

For executing the ACS operation in the tenth 
embodiment, the BSM of the comparison result of the 
ALU 29 is shifted in the shift register 43 at any 
25. time. The BSM of the comparison result of the 
comparator 26 is shifted in the shift register 44 
at any time. Thereby, a path select signal can be 
stored in the RAM 14. In this case, the path select 
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signal shows which path of two paths has been selected, 
and is used in executing the trace-back after the 
end of the ACS operation. 

For example, in a case where the bit width of 
5 the shift register 43 and that of the shift register 
44 are s i ng 1 e - p r e c i s i on data widths, the path select 
signal can be stored when the ACS operation 
corresponding to a number of s i ng 1 e - p r e c i s i on bits 
are executed. 

10 Thus, according to the tenth embodiment, the 

shift registers are used as storing means for 
executing the ACS operations and for storing the 
comparison result. Thereby, for example, the shift 
registers can be used as an operation instruction 

15 for using a shift register of a division system. 
Therefore, in the case of implementing the processing 
unit in an LSI form, the chip area can be further 
decreased, and the manufacturing cost can be reduced. 
Note that the similar operation can be executed even 

20 if the dual port RAM is used as the storing section 
for storing the path metrics. 
(Eleventh Embodiment ) 

FIG. 2 5 is a block diagram showing the structure 
of the processing unit of the eleventh embodiment 
25 of the present invention. In the processing unit 
of FIG. 25, the same reference numerals are added 
to the portions common to the processing unit of FIG. 
24 and the explanation is omitted. 
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As compared with the processing unit of FIG. 
24, in the processing unit of FIG. 25, the input 
register 17 swaps the path metric data so as to be 
input from the bus 2. Then, 4:2 compressor 40 
5 directly inputs the branch metric data without 
swapping the branch metric data, and a negate value 
of the comparison result of the comparator 26 is 
shifted in the shift register 44. 

The processing unit of FIG. 25 is suitable for 
10 the operation processing of the pipe line structure 
shown in FIG. 15. 

For executing the ACS operation in this 
embodiment, two path metrics { A , B } a r e d i r e c 1 1 y i npu t 
to the input register 16 as { A , B } , and input to the 
15 input register 17 as { B , A } in a swapped state. After 
that, two branch metrics are input from the swap 
circuit 18 to the 4:2 compressor 40 as {C} and {~D}, 
and two path metrics are input from the input register 
17 to the 4:2 compressor 40 as {B} and {~~A}, and {A + B} 
20 and {B+C} are output. 

Then, the comparator 26 inputs two outputs {A + B} 
and {B+C} so as to calculate {A+D-B-C}. 

On the other hand, the doub 1 e - p r e c i s i on adder 
42 inputs two branch metrics as {C, D} from the swap 
25 circuit 18, and inputs two path metrics as {B, A} 
from the input register. Then, {B + C} and {A+D} are 
simultaneously computed in parallel, and output to 
the selecting section 12 in the form of {B + C, A + D}. 
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Then, the MSB of the comparison result is output 
to the selecting section 12 from the comparator 26, 
and the MSB of the negate value of the comparison 
result is output to the shift register 44. 
5 Thus, according to the eleventh embodiment, one 

of the input registers for storing two path metrics 
swaps data to be input. As a result, since the need 
of the swapping operation at the input of the 4:2 
compressor 40 and that of the double-precision adder 

10 4 2 can be eliminated at the operation execution (EX) 
stage, the ACS operation can be executed at higher 
speed. Note that the similar operation can be 
executed even if the dual port RMA is used as the 
means for storing the path metrics. 

15 (Twelfth Embodiment) 

FIG. 26 is a block diagram showing the structure 
of a mobile station apparatus in the twelfth 
embodiment. A mobile station apparatus 45 shown in 
FIG. 26 comprises an antenna section 46 for both 

20 reception and transmission, a radio section 47 having 
a receiving section 48 and a transmitting section 
49, a base band signal processing section 50 for 
executing a signal modulation and demodulation, and 
a signal coding and decoding, a speaker 58 for 

25 outputting a sound, a microphone 59 for inputting 
a sound, a data input/output section 60 for 
inputting/ outputting data to be received and 
transmitted from/to an outer device, a display 
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section 61 for displaying an operation state, an 
operation section 62 such as a 10-button keypad, and 
a control section 63 for controlling the respective 
parts . 

5 The base band signal processing section 50 

comprises a demodulation section 51 for demodulating 
a received signal, a modulation section 52 for 
modulating a transmitted signal, and a DSP 53 of one 
chip . 

10 The DSP 53 comprises a Viterbi decoding section 

55, which is formed by any one of the processing units 
of the first to eleventh embodiments, a convolutional 
coding section 56 for convolutional coding the 
transmitted signal, a voice codec section 57 for 

15 executing a voice signal coding and decoding, and 
a timing control section 54 for controlling timing 
for sending the received signal to the Viterbi 
decoding section 55 from the demodulation section 
51 and timing for sending the transmitted signal to 

20 the modulation section 52 from the convolutional 
coding section 56. These devices are formed by 
software, respectively. 

The control section 63 displays a signal input 
from the operation section 62 to the display section 

25 61, receives the signal input from the operation 
section 62. Then, the control section 63 outputs 
a control signal for performing a calling operation 
to the antenna section 46, the radio section 47, and 
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the base band signal processing section 50 in 
accordance with a communication sequence. 

If the voice is transmitted from the mobile 
station apparatus 45, the voice signal input from 
5 the microphone 59 is AD converted by an AD converter 
(not shown) . Then, the converted signal is coded 
by the voice codec section 57 so as to be input to 
the convolut ional coding section 56. If data is 
transmitted, data input from the outer section is 

10 input to the convolut ional coding section 56 through 
the data input/output section 60. 

Data input to the c on vo 1 u t i on a 1 coding section 
56 is convolut ional coded, and the timing control 
section 54 sorts data and adjusts the transmission 

15 output timing so as to output data to the modulation 
section 52. Data input to the modulation section 
52 is digitally modulated, AD converted, and output 
to the transmitting section 49 of the radio section 
47. Data input to the transmitting section 49 is 

20 converted to radio signals, and output to the antenna 
section 46 as radio waves. 

On the other hand, for outputting data received 
by the mobile station apparatus 45, the radio waves 
received by the antenna portion 46 are received by 

25 the receiving section 48 of the radio potion 47, AD 
converted, and output to the demodulation section 
51 of the base band signal processing section 50. 
Data demodulated by the demodulation section 51 is 
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sorted by the timing control section 54, thereafter 
being decoded by the Viterbi decoding section 55. 

In the case of voice communications, decoded 
data is voice decoded by the voice codec section 57, 
and is DA converted, thereafter being output to the 
speaker 58 as a voice. In the case of data 
communications , data decoded by the Viterbi decoding 
section 55 is output to the outer section through 
the data input/output section 60. 

In the mobile station apparatus 4 5 of the twelfth 
embodiment, the respective parts of the Viterbi 
decoding section 55, the c o n vo 1 u t i on a 1 coding 
section 56, the vo i ce code c s e ct i on 5 7 , andthetiming 
control section 54 are formed by software of one chip 
DSP 53. Thus, the mobile station apparatus 45 can 
be assembled by a small number of parts. Also, since 
the Viterbi decoding section 55 is formed by any one 
of the processing units of the first to eleventh 
embodiments, the update of two path metrics can be 
realized with one machine cycle in the pipe line 
processing using DSP 53. Thereby, the high speed 
ACS operation of the Viterbi decoding using DSP 53 
can be realized with relative a small amount of 
processing. 

In this embodiment, the demodulation section 
51 and the modulation section 52 are shown to be 
differentiated from DSP 53. However, these devices 
can be formed by software of DSP 53. Also, the DSP 
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of the sixth embodiment can be used as DSP 53, and 
the convolut ional coding section 56, the voice codec 
section 57, and the timing control section 54 can 
be formed by the other parts, respectively. 
5 (Thirteenth Embodiment) 

FIG. 27 is a block diagram showing the structure 
of a mobile station apparatus in the thirteenth 
embodiment. In a mobile station apparatus 45A of 
FIG. 27, the same reference numerals are added to 
10 the portions common to the portions of the mobile 
station apparatus 45 of FIG. 26, and the explanation 
i s omitted . 

As compared with the mobile station apparatus 
45 of FIG. 26, in the mobile station apparatus 45A 

15 of FIG. 27, a spreading section 65 is provided in 
a modulation section 52A, and a despreading section 
64 is provided in a demodulation section 51A, so that 
a base band signal processing section 50A of a CDMA 
communication system is formed. In the case of the 

20 CDMA communication system, in some cases, a RAKE 
receiving section, in which a plurality of fingers 
selected from a delay profile are adjusted to each 
other, is included in the timing control section 54. 
Thus, in the mobile station apparatus 45A in 

25 the thirteenth embodiment, the despreading section 
64 is provided in the demodulation section 51A and 
the spreading section 65 is provided in the modulation 
section 52A. Thereby, the mobile station apparatus 
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45A of the thirteenth embodiment can be applied to 
the CDMA communication system. 
(Fourteenth Embodiment) 

FIG. 2 8 is a block diagram showing the structure 
5 of a base station apparatus in the fourteenth 
embodiment . 

In FIG. 28, a base station apparatus 68 of the 
fourteenth embodiment comprises the antenna section 
46 having an antenna 66 for receiving and an antenna 

10 67 for transmitting, the radio section 47 having the 
receiving section 48 and the transmitting section 
49, a base band signal processing section 69 for 
executing a signal modulation and demodulation and 
a signal coding and decoding, the data input/output 

15 section 60 for inputting/outputting data to be 
received and transmitted from/to a cable network, 
and the control section 63 for controlling the 
respective parts. 

The base band signal processing section 69 

20 comprises the demodulation section 51 for 
demodulating the received signal, the modulation 
section 52 for modulating the transmitted signal, 
and one chip DSP 53A. The DSP 53A comprises the 
Viterbi decoding section 55, which is formed by any 

25 one of the processing units of the first to eleventh 
embodiments, the c o n v o 1 u t i o n a 1 coding section 56 for 
convolutional coding the transmitted signal, and the 
timing control section 54 for controlling timing for 



sending the received signal to the Viterbi decoding 
section 55 from the demodulation section 51 and timing 
for sending the transmitted signal to the modulation 
section 52 from the convo lu t i ona 1 coding section 56. 
5 These devices are formed by software, respectively. 

When data is received to the base station 
apparatus 68 from the cable network, data is input 
to the convolut ional coding section 56 through the 
data input/output section 60. Then, data input to 

10 the convolut ional coding section 56 is c on vo 1 u t i on a 1 
coded, and the timing control section 54 sorts input 
data and adjusts the transmission output timing so 
as to output data to the modulation section 52 . Data 
input to the modulation section 52 is digitally 

15 modulated, AD converted, and is converted to radio 
signals by the transmitting section 49. Then, the 
radio signals are transmitted from the antenna 
section 46 as radio waves. 

On the other hand, if. data is received to the 

20 base station apparatus 68 from the radio network, 
the radio waves received by the antenna portion 46 
are AD converted by the receiving section 48 and 
demodulated by the demodulation section 51 of the 
base band signal processing section 69. Demodulated 

25 data is sorted by the timing control section 54, and 
decoded by the Viterbi decoding section 55, 
thereafter being output to the cable network via the 
data input/output section 60. 
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In the base station apparatus 68 of the 
fourteenth embodiment, the respective parts of the 
Viterbi decoding section 55, the c on v o 1 u t i o n a 1 
coding section 56, and the timing control section 
5 5 4 are formed by software of one chip DSP 53A. Thus, 
the base station apparatus 68 can be assembled by 
a small number of parts. Also, since the Viterbi 
decoding section 55 is formed by any one of the 
processing units of the first to eleventh embodiments, 

10 the update of two path metrics can be realized with 
one machine cycle in the pipe line processing using 
DSP 53A. Thereby, the high speed ACS operation of 
the Viterbi decoding using DSP 53A can be realized 
with relatively a small amount of processing. 

15 In this embodiment, the demodulation section 

51 and the modulation section 52 are shown to be 
differentiated from DSP 53A . However, thesedevices 
can be formed by software of DSP 53A. Also, the DSP 
of the sixth embodiment can be used as DSP 53A, and 

20 the convolut ional coding section 56, the voice codec 
section 57, and the timing control section 54 can 
be formed by the other parts, respectively. 
(Fifteenth Embodiment) 

FIG. 2 9 is a block diagram showing the structure 
25 of a base station apparatus in the fifteenth 
embodiment. In a base station apparatus 68A of FIG. 
29, the same reference numerals are added to the 
portions common to the portions of the base station 
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apparatus 68 of FIG. 28, and the explanation is 
omitted . 

As compared with the mobile station apparatus 
45 of FIG. 26, in the mobile station apparatus 45A 
of FIG. 27, the spreading section 65 is provided in 
the modulation section 52A, and the despreading 
section 64 is provided in the demodulation section 
51A, so that the base band signal processing section 
50A of the CDMA communication system is formed. In 
the case of the CDMA communication system, in some 
cases, the RAKE receiving section, in which the 
plurality of fingers selected from the delay profile 
are adjusted to each other, is included in the timing 
control section 54. 

Thus, in the base station apparatus 68A of the 
fifteenth embodiment, the despreading section 64 is 
provided in the demodulation section 51A and the 
spreading section 65 is provided in the modulation 
section 52 A. Thereby, the base station apparatus 
68A of the fifteenth embodiment can be applied to 
the CDMA communication system. 

As mentioned above, the update of two path 
metrics can be realized with one machine cycle in 
the pipe line processing using DSP. Thereby, the 
high speed ACS operation of the Viterbi decoding using 
DSP can be realized with relative a small amount of 
processing. This makes it possible to downsize the 
portable terminal , reducing the weight , lowering the 
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cost, and increasing the life of a batter 



